CDS

Accession Number TCMCG075C03540
gbkey CDS
Protein Id XP_017981808.1
Location complement(join(32881019..32881159,32881251..32881286,32881381..32881494,32881779..32881997,32882325..32882450,32882538..32882612,32882695..32882699,32882810..32882966,32883069..32883149,32883244..32883297,32883884..32883955))
Gene LOC18613845
GeneID 18613845
Organism Theobroma cacao

Protein

Length 359aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018126319.1
Definition PREDICTED: cathepsin B [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category O
Description Belongs to the peptidase C1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00536        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K01363        [VIEW IN KEGG]
EC 3.4.22.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04140        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
ko04210        [VIEW IN KEGG]
ko04612        [VIEW IN KEGG]
ko04621        [VIEW IN KEGG]
ko04924        [VIEW IN KEGG]
map04140        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
map04612        [VIEW IN KEGG]
map04621        [VIEW IN KEGG]
map04924        [VIEW IN KEGG]
GOs GO:0000323        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004175        [VIEW IN EMBL-EBI]
GO:0004197        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005576        [VIEW IN EMBL-EBI]
GO:0005615        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005764        [VIEW IN EMBL-EBI]
GO:0005773        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0006508        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008233        [VIEW IN EMBL-EBI]
GO:0008234        [VIEW IN EMBL-EBI]
GO:0009056        [VIEW IN EMBL-EBI]
GO:0009057        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0030163        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044248        [VIEW IN EMBL-EBI]
GO:0044257        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044265        [VIEW IN EMBL-EBI]
GO:0044267        [VIEW IN EMBL-EBI]
GO:0044421        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0051603        [VIEW IN EMBL-EBI]
GO:0070011        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]
GO:1901565        [VIEW IN EMBL-EBI]
GO:1901575        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAAGGACATGGCGAATCCCCTGCTCTTTTTGGCTAGCTTCCTGTTACTTCTCTCCACGGTTCACCCAAAGGTAATTGCTGTGGAACAACTTTCTGAGGTCAAGCTTAACTCTCAAATCCTTCAGGATTCAATTGTGAAACAAGTAAATGAAAATCCCAAGGCTGGATGGAAAGCTGCCCTGAATCCTCGACTTTCTAACTACACTGTTGGTGAGTTTAAGCATCTTCTTGGAGTCAAACCAACACCCAAGAAGGAACTTCTGGGTATTCCTGTTATAACACATGACAAATCCTTAAAGGTGCCAACTAAGTTCGATGCTAGAACAGCTTGGCCACAATGTAGCACAATCGGAAGAATTCTTGATCAGGGTCACTGTGGTTCTTGCTGGGCTTTTGGTGCTGTTGAATCACTATCTGATCGTTTCTGTATCCATTTTAGCATGAATATATCTCTGTCCGTTAATGATCTTCTGGCGTGCTGTGGCTTTTTATGTGGAAGTGGTTGTGATGGGGGGTATCCAATTTCTGCATGGCGATATTTTGTGCGCCGTGGTGTTGTCACTGAGGAGTGTGATCCATATTTTGATGATACTGGTTGTTCTCACCCTGGTTGTGAGCCTGCATATCCTACTCCCAGATGTGTTAAAAAGTGTGTTAAGGGAAACCAACTCTGGAGGGAGTCCAAGCACTATAGTGTCGGGGCGTACAGAATCAACTCTGATCCAGCTGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAGGTCTCCTTCACCGTTTATGAGGATTTTGCTCACTACAAGTCAGGAGTTTACAAATATGTGACAGGCGGTGTCATGGGAGGTCATGCAGTTAAGCTTATTGGTTGGGGAACATCTGATGATGGGGAGGATTACTGGCTTCTTGCAAACCAGTGGAATAGAGGCTGGGGTGACGATGGCTACTTCAAGATTAGCAGAGGCACAAACGAGTGTGGTATTGAAGATGATGTCGTAGCTGGTTTGCCTTCTACCAAAAATCTTGTTAGAGAGGTAGGTGACATGGACACTCTCGAAGATGCTTTGTTTCGAGAGTAA
Protein:  
MKDMANPLLFLASFLLLLSTVHPKVIAVEQLSEVKLNSQILQDSIVKQVNENPKAGWKAALNPRLSNYTVGEFKHLLGVKPTPKKELLGIPVITHDKSLKVPTKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFSMNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHPGCEPAYPTPRCVKKCVKGNQLWRESKHYSVGAYRINSDPADIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYVTGGVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGTNECGIEDDVVAGLPSTKNLVREVGDMDTLEDALFRE